Minimising Regret in Route Choice

نویسنده

  • Gabriel de Oliveira Ramos
چکیده

The use of reinforcement learning (RL) in multiagent scenarios is challenging. I consider the route choice problem, where drivers must choose routes that minimise their travel times. Here, selfish RL-agents must adapt to each others’ decisions. In this work, I show how the agents can learn (with performance guarantees) by minimising the regret associated with their decisions, thus achieving the User Equilibrium (UE). Considering the UE is inefficient from a global perspective, I also focus on bridging the gap between the UE and the system optimum. In contrast to previous approaches, this work drops any full knowledge assumption.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning to Minimise Regret in Route Choice

Reinforcement learning (RL) is a challenging task, especially in highly competitive multiagent scenarios. We consider the route choice problem, in which self-interested drivers aim at choosing routes that minimise their travel times. Employing RL here is challenging because agents must adapt to each others’ decisions. In this paper, we investigate how agents can overcome such condition by minim...

متن کامل

Risk Route Choice Analysis and the Equilibrium Model under Anticipated Regret Theory

The assumption about travellers’ route choice behaviour has major influence on the traffic flow equilibrium analysis. Previous studies about the travellers’ route choice were mainly based on the expected utility maximization theory. However, with the gradually increasing knowledge about the uncertainty of the transportation system, the researchers have realized that there is much constraint in ...

متن کامل

On Estimating Action Regret and Learning From It in Route Choice

The notion of regret has been extensively employed to measure the performance of reinforcement learning agents. The regret of an agent measures how much worse it performs following its current policy in comparison to following the best possible policy. As such, measuring regret requires complete knowledge of the environment. However, such an assumption is not realistic in most multiagent scenar...

متن کامل

Relationship between impulsive choice and emotional distress with students’ procrastination: mediating role of anticipated regret and consideration of future consequences

Given the wide prevalence of procrastination and delaying tasks and the need to identify factors affecting this problem, present study aimed to investigate the mediating role of anticipated regret and consideration of future consequences in the relationship between impulsive choice and emotional distress with procrastination. In an analytical cross-sectional study, 400 students were selected th...

متن کامل

The Tyranny Of Choice: A Cross-Cultural Investigation Of Maximizing-Satisficing Effects On Well-Being

The present research investigated the relationship between individual differences in maximizing versus satisficing (i.e., seeking to make the single best choice, rather than a choice that is merely good enough) and well-being, in interaction with the society in which an individual lives. Data from three distinct cultural groups (adults), drawn respectively from the U.S. (N=307), Western Europe ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017